Chapter 2 Multi-arm Bandits

Evaluative feedback is the basis of methods for function optimization, including evolutionary methods.

2.1 A k-armed Bandit Problem